Yellowstone County
What's the Deal with U.F.O.s?
When I was growing up, I watched a lot of sci-fi movies about aliens that come to Earth. The extraterrestrials in popular culture, however, always looked so familiar that I found them far-fetched. What are the chances that E.T., the Predator, or ALF would develop arms and legs, a humanlike face, and opposable thumbs? Perhaps as a result, I associated alien life more with fantasy than with science, and I never gave much thought to what a visit would really look like. But my attitude started to change in 2020, when I read Liu Cixin's "The Three-Body Problem" and its two sequels.
- North America > United States > Montana > Yellowstone County > Billings (0.05)
- North America > United States > Illinois > Cook County > Chicago (0.05)
- North America > United States > California (0.05)
- Government > Regional Government > North America Government > United States Government (0.30)
- Government > Military (0.30)
WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Tu, Shangqing, Sun, Yuliang, Bai, Yushi, Yu, Jifan, Hou, Lei, Li, Juanzi
To mitigate the potential misuse of large language models (LLMs), recent research has developed watermarking algorithms, which restrict the generation process to leave an invisible trace for watermark detection. Due to the two-stage nature of the task, most studies evaluate the generation and detection separately, thereby presenting a challenge in unbiased, thorough, and applicable evaluations. In this paper, we introduce WaterBench, the first comprehensive benchmark for LLM watermarks, in which we design three crucial factors: (1) For \textbf{benchmarking procedure}, to ensure an apples-to-apples comparison, we first adjust each watermarking method's hyper-parameter to reach the same watermarking strength, then jointly evaluate their generation and detection performance. (2) For \textbf{task selection}, we diversify the input and output length to form a five-category taxonomy, covering $9$ tasks. (3) For \textbf{evaluation metric}, we adopt the GPT4-Judge for automatically evaluating the decline of instruction-following abilities after watermarking. We evaluate $4$ open-source watermarks on $2$ LLMs under $2$ watermarking strengths and observe the common struggles for current methods on maintaining the generation quality. The code and data are available at \url{https://github.com/THU-KEG/WaterBench}.
- Africa > Ghana (0.05)
- Oceania > Australia (0.04)
- North America > United States > Texas (0.04)
- (26 more...)
- Personal (0.92)
- Research Report > New Finding (0.46)
- Materials > Metals & Mining > Gold (1.00)
- Information Technology > Security & Privacy (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Accurate Measures of Vaccination and Concerns of Vaccine Holdouts from Web Search Logs
Chang, Serina, Fourney, Adam, Horvitz, Eric
To design effective vaccine policies, policymakers need detailed data about who has been vaccinated, who is holding out, and why. However, existing data in the US are insufficient: reported vaccination rates are often delayed or missing, and surveys of vaccine hesitancy are limited by high-level questions and self-report biases. Here, we show how large-scale search engine logs and machine learning can be leveraged to fill these gaps and provide novel insights about vaccine intentions and behaviors. First, we develop a vaccine intent classifier that can accurately detect when a user is seeking the COVID-19 vaccine on search. Our classifier demonstrates strong agreement with CDC vaccination rates, with correlations above 0.86, and estimates vaccine intent rates to the level of ZIP codes in real time, allowing us to pinpoint more granular trends in vaccine seeking across regions, demographics, and time. To investigate vaccine hesitancy, we use our classifier to identify two groups, vaccine early adopters and vaccine holdouts. We find that holdouts, compared to early adopters matched on covariates, are 69% more likely to click on untrusted news sites. Furthermore, we organize 25,000 vaccine-related URLs into a hierarchical ontology of vaccine concerns, and we find that holdouts are far more concerned about vaccine requirements, vaccine development and approval, and vaccine myths, and even within holdouts, concerns vary significantly across demographic groups. Finally, we explore the temporal dynamics of vaccine concerns and vaccine seeking, and find that key indicators emerge when individuals convert from holding out to preparing to accept the vaccine.
- North America > United States > New York > New York County > New York City (0.14)
- North America > United States > New York > Bronx County (0.14)
- North America > United States > Wyoming (0.04)
- (23 more...)
- Health & Medicine > Therapeutic Area > Vaccines (1.00)
- Health & Medicine > Therapeutic Area > Immunology (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
- Information Technology > Information Management > Search (1.00)
- Information Technology > Data Science (1.00)
- Information Technology > Communications (1.00)
- (2 more...)
AI may be on its way to your doctor's office, but it's not ready to see patients
What use could healthcare have for someone who makes things up, can't keep a secret, doesn't really know anything, and, when speaking, simply fills in the next word based on what's come before? Lots, if that individual is the newest form of artificial intelligence, according to some of the biggest companies out there. Companies pushing the latest AI technology -- known as "generative AI" -- are piling on: Google and Microsoft want to bring types of so-called large language models to healthcare. Big firms that are familiar to folks in white coats -- but maybe less so to your average Joe and Jane -- are equally enthusiastic: Electronic medical records giants Epic and Oracle Cerner aren't far behind. The space is crowded with startups, too.
- North America > United States > Montana > Yellowstone County > Billings (0.05)
- North America > United States > California > San Diego County > San Diego (0.05)
- Asia > Middle East > Israel (0.05)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.56)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.38)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.36)
Ask Me Anything: A simple strategy for prompting language models
Arora, Simran, Narayan, Avanika, Chen, Mayee F., Orr, Laurel, Guha, Neel, Bhatia, Kush, Chami, Ines, Sala, Frederic, Ré, Christopher
Large language models (LLMs) transfer well to new tasks out-of-the-box simply given a natural language prompt that demonstrates how to perform the task and no additional training. Prompting is a brittle process wherein small modifications to the prompt can cause large variations in the model predictions, and therefore significant effort is dedicated towards designing a painstakingly "perfect prompt" for a task. To mitigate the high degree of effort involved in prompt-design, we instead ask whether producing multiple effective, yet imperfect, prompts and aggregating them can lead to a high quality prompting strategy. Our observations motivate our proposed prompting method, ASK ME ANYTHING (AMA). We first develop an understanding of the effective prompt formats, finding that question-answering (QA) prompts, which encourage open-ended generation ("Who went to the park?") tend to outperform those that restrict the model outputs ("John went to the park. Output True or False."). Our approach recursively uses the LLM itself to transform task inputs to the effective QA format. We apply the collected prompts to obtain several noisy votes for the input's true label. We find that the prompts can have very different accuracies and complex dependencies and thus propose to use weak supervision, a procedure for combining the noisy predictions, to produce the final predictions for the inputs. We evaluate AMA across open-source model families (e.g., EleutherAI, BLOOM, OPT, and T0) and model sizes (125M-175B parameters), demonstrating an average performance lift of 10.2% over the few-shot baseline. This simple strategy enables the open-source GPT-J-6B model to match and exceed the performance of few-shot GPT3-175B on 15 of 20 popular benchmarks. Averaged across these tasks, the GPT-J-6B model outperforms few-shot GPT3-175B. We release our code here: https://github.com/HazyResearch/ama_prompting
- North America > United States > New Jersey (0.14)
- Africa > Middle East > Libya (0.14)
- North America > United States > California > Santa Clara County > Palo Alto (0.14)
- (83 more...)
- Research Report (1.00)
- Personal (0.92)
- Transportation > Passenger (1.00)
- Transportation > Ground (1.00)
- Transportation > Air (1.00)
- (20 more...)
Bosch Drawing Lessons From Autonomous Car Pilot Program in San Jose Digital Trends
The companies racing to deploy autonomous cars on the world's roads took a reality check in the 2010s, but multimillion-dollar development efforts remain ongoing across the automotive and tech industries. German supplier Bosch is notably moving full speed ahead with its quest to make driverless cars a reality. Kay Stepper, Bosch's senior vice president of automated driving, sat down with Digital Trends to talk about the state of autonomous driving in 2020, and what's next for the artificial intelligence technology that powers the prototypes it's testing. Bosch has never made a car, so it brings its innovations to the market through partnerships with automakers. It chose Mercedes-Benz parent company Daimler to test autonomous technology in real-world conditions via a ridesharing pilot program in San Jose, California, close to one of the company's research centers.
- North America > United States > California > Santa Clara County > San Jose (0.25)
- North America > United States > Montana > Yellowstone County > Billings (0.05)
- Europe > France (0.05)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
- Transportation > Passenger (1.00)
- Transportation > Ground > Road (1.00)
- Information Technology > Robotics & Automation (1.00)
- Automobiles & Trucks (1.00)
Sparse Learning for Variable Selection with Structures and Nonlinearities
In this thesis we discuss machine learning methods performing automated variable selection for learning sparse predictive models. There are multiple reasons for promoting sparsity in the predictive models. By relying on a limited set of input variables the models naturally counteract the overfitting problem ubiquitous in learning from finite sets of training points. Sparse models are cheaper to use for predictions, they usually require lower computational resources and by relying on smaller sets of inputs can possibly reduce costs for data collection and storage. Sparse models can also contribute to better understanding of the investigated phenomenons as they are easier to interpret than full models.
- North America > United States > Connecticut (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- South America > Uruguay > Maldonado > Maldonado (0.04)
- (8 more...)
- Banking & Finance (1.00)
- Government (0.67)
Views of AI, robots, and automation based on internet search data
Artificial intelligence, robots, and automation are rising in importance in many areas. As noted in the recent book, "The Future of Work: Robots, AI, and Automation," there are exciting advances in finance, transportation, national defense, smart cities, and health care, among other areas. Businesses are developing solutions that improve the efficiency and effectiveness of their operations and using these tools to improve the way their firms function. Yet there also are concerns about the impact of these developments on jobs and personal privacy. A Pew Research Center national survey revealed considerable unease about emerging trends.
- North America > United States > California > San Francisco County > San Francisco (0.15)
- Asia > China (0.06)
- North America > United States > Virginia > Albemarle County > Charlottesville (0.05)
- (16 more...)
- Transportation > Ground > Road (1.00)
- Information Technology (1.00)
- Automobiles & Trucks (1.00)
- Government > Regional Government (0.70)
Learning Predictive Leading Indicators for Forecasting Time Series Systems with Unknown Clusters of Forecast Tasks
Gregorova, Magda, Kalousis, Alexandros, Marchand-Maillet, Stephane
We present a new method for forecasting systems of multiple interrelated time series. The method learns the forecast models together with discovering leading indicators from within the system that serve as good predictors improving the forecast accuracy and a cluster structure of the predictive tasks around these. The method is based on the classical linear vector autoregressive model (VAR) and links the discovery of the leading indicators to inferring sparse graphs of Granger causality. We formulate a new constrained optimisation problem to promote the desired sparse structures across the models and the sharing of information amongst the learning tasks in a multi-task manner. We propose an algorithm for solving the problem and document on a battery of synthetic and real-data experiments the advantages of our new method over baseline VAR models as well as the state-of-the-art sparse VAR learning methods.
- North America > United States > California (0.14)
- North America > United States > Connecticut (0.05)
- North America > United States > Montana > Yellowstone County > Billings (0.04)
- (4 more...)